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We investigate the potential of scale- free networks as error-correcting codes. We find tliat irregular 
low-density parity-check codes with highest performance known to date have degree distributions 
well fitted by a power-law function p{k) ~ fe~^ with 7 close to 2, which suggests that codes built 
on scale-free networks with appropriate power exponents can be good error- correcting codes, with 
performance possibly approaching the Shannon limit. We demonstrate for an erasure channel that 
codes with power-law degree distribution of the form p{k) = C{k + a) with k > 2 and suitable 
selection of the parameters a and 7, indeed have very good error-correction capabilities. 

PACS numbers: 89.75.Hc, 89.70.-l-c, 89.75.Fb 



A variety of complex networks [l| exhibit a topological 
structure in which the connectivity between their con- 
stituent nodes follows a simple power law. Examples 
of such scale-free networks include the Internet 0, 
the World Wide Web (WWW) social networks [|, 

metabolic networks 0, etc. Extensive studies have been 
made to understand the topological features and evolv- 
ing dynamics 0, 0, ^3 of these networks. While many 
intriguing properties concerning the structural aspect of 
complex networks have been revealed thanks to these ef- 
forts, relatively little has been known about the effects of 
specific connectivity structures on networks' functional 
behavior In order to properly operate under a cer- 
tain environment or, in a more active sense, to success- 
fully accomplish a given task, a complex network may 
favor one particular structure over another. For prac- 
tical applications, it now appears that more attention 
need be paid to the functional aspect of these complex 
networks viewed as whole systems or organisms working 
for particular purposes. 

Recent advances in channel coding theory have led 
to the perception that the state-of-the-art capacity- 
approaching codes, such as Turbo code s a nd low- 
density parity-check (LDPC) codes [T^ Il4 Il5l|. can be 
understood in terms of graphs (or networks) consisting 
of nodes and edges The function of these graphs is 
to carry out error correction, i.e. to recover original data 
transmitted over noisy channels, by iteratively passing 
certain messages through edges connecting neighboring 
nodes. The art of developing a high-performance error- 
correcting code lies in designing a connectivity structure 
of a graph in such a way as to make the code built on 
it perform a desired function. One very important is- 
sue concerns finding the connectivity distribution that 
achieves the Shannon's capacity limit. Most attempts, 
however, have been limited to numerical optimization, 
and a complete understanding of the connectivity struc- 
ture specific to capacity-achieving codes is still lacking. 
Inspired by the ubiquitous nature of scale-free networks, 
one may ask whether the connectivity structure of scale- 
free networks could offer any insight into seeking good 
graph-based codes. 

In this paper we address the question whether scale- 



free networks whose connectivity distribution follows 
a power law can function effectively as good error- 
correcting codes. The codes built on scale-free networks 
considered here are basically LDPC codes in that the as- 
sociated parity-check matrices are sparse and that the 
belief propagation algorithm |13L flil Il5l | is employed for 
decoding. We first show that the degree distributions 
of LDPC codes with highest performance known to date 
are well fitted by power-law functions. Motivated by this 
finding, we generate a degree distribution according to 
the function p{k) — C{k + a)^'^ and fine tune the pa- 
rameters a and 7 to maximize the code's performance. 
We investigate error-correction capability of these codes 
over a binary erasure channel and compare them with the 
Tornado code 0|, the first commercialized LDPC code. 

An LDPC code can be represented by a bipartite graph 
in which there are two different types of nodes: variable 
nodes and check nodes. Nodes of one type are connected 
by edges only to nodes of the other type. Variable nodes 
are associated with data bits, and check nodes exam- 
ine whether the variable nodes connected to them sat- 
isfy parity-check equations. Error correction of corrupted 
data bits is performed by passing certain messages, e.g. 
likelihood ratios, through edges back and forth between 
variable and check nodes. It is known from the density 
evolution analysis |l5j | that, under the assumption of a 
tree-structured random graph with no closed loops, the 
error-correction capability of a code is solely determined 
by the degree distribution. 

We begin by inspecting the degree distributions of 
some high-performance LDPC codes. Figure QJa) shows 
the variable-node degree distribution of the LDPC code 
designed by Chung et al. which has been optimized 
for an additive white Gaussian noise channel and ap- 
proaches the Shannon limit within 0.0045 dB, presently 
the world record. Here, in order to obtain a meaningful 
distribution from the irregularly spaced data X{ki) in Ta- 
blc^lof Chung et al. 0, we took a local average over a 
bin of length (fc^+i — fci_i)/2: 

= ^^t^ W9 ' 

- fci-i)/2 

where P{ki) is the fraction of nodes with degree ki and is 
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FIG. 1: Degree distributions for high-performance LDPC 
codes, (a) The LDPC code designed by Chung et al. fTablellTI 
of (b) The Tornado code with maximum degree dmax = 

610 |l l ll9 | . The best fitting lines have slopes (a) 7 ~ 2.14 
and (b) 7 ~ 2.02. 



given by P(fci) = CX{ki)/ki, in which X{ki) is the fraction 
of edges connected to a variable node of degree ki and C 
is a normahzation constant. It can be seen from Fig.^a) 
that the degree distribution is well fitted by a power-law 
function p{k) ~ k~^ with 7 ~ 2.14. A more dramatic 
correspondence is observed for the variable-node degree 
distribution of the Tornado code [itI , as apparently 
seen in Fig.^b). The Tornado code has been optimized 
for an erasure channel and has a Poisson distribution for 
its check-node degree distribution. The power-law func- 
tion that best fits the variable-node degree distribution 
is found to have an exponent 7 ~ 2.02, and the fitting 
appears to be nearly perfect for large k. We also find 
that the right-regular sequence of Shokrollahi that 
slightly beats the Tornado sequence is well fitted by a 
similar power-law function. 

The fact that many high-quality LDPC codes have de- 
gree distributions well fitted by power-law functions stim- 
ulates us to test scale-free networks as error-correcting 
codes. Following the approach of Newman et al. [20j . 
we write the generating function for a general scale-free 
network in the form: 

G{x) = + E ?'(^)^'' (2) 

fc=0 k=di 

where the fraction of nodes with degree k > di follows 
a power law p(k) — Ck^'^, and the terms Uk with low 
degree k < di are separated from the second sum to al- 
low for possible deviation from the power law for small 
k, as is often the case for general scale-free networks. 
The generating function in Eq. (|2J), however, contains too 
many parameters to be amenable to numerical optimiza- 
tion unless di is sufficiently small. To reduce the number 
of parameters while still retaining the possibility that the 
distribution for low degrees may not obey an exact power 



law, we instead choose the following generating function: 

fe=0 

where p{k) — C{k + a) . If a < (a > 0), the degree 
distribution for small k lies above (below) the power-law 
function k~^ . We henceforth use Eq. Q to generate a 
variable-node degree distribution of our code and opti- 
mize the parameters a and 7 to achieve the best perfor- 
mance. 

Some empirical results known about LDPC codes help 
us to further refine our code. The most well known find- 
ings related to features of good LDPC codes may be that 
the variable nodes of degree one should be removed since 
they do not contribute to error correction and that the 
codes with almost uniform check-node degree yield good 
performance [Tsl 21] . Taking these into account, we 
let the sum in Eq. Q start from fc = 2, and restrict 
the check-node degree to two consecutive integers: the 
generating function for the check-node degree is written 
as F{x) = bx^ -I- (1 — 6)x*"'""'^, where the parameters b 
and i are easily determined once a variable-node degree 
distribution is selected. This choice of the check-node 
degree distribution enables us to design a code without 
restrictions on dmax for any given code rate; this prop- 
erty, however, is not shared by the right-regular sequence 
[isi l for which dmax is allowed to have only a special set 
of values. 

The performance of an LDPC code over a binary era- 
sure channel can be evaluated by the density evolution 
method |l^ as follows. Let 5 be the erasure probability 
of a given channel, and consider a code with a degree dis- 
tribution pair A(a;) = 'Y^\kX^~'^ and p{x) = ^pfcX*"'"^, 
where Afc [pk) is the fraction of edges connected to a 
variable (check) node of degree k. Note that the distri- 
bution here is defined in terms of the fraction of edges, 
not the fraction of nodes as before. Then, if the belief 
propagation algorithm is used for decoding, the messages 
passed between the variable and check nodes are known 
to evolve as [TsIIitI 

xi^xa\{l- p{l-xi^i)), (4) 

where xi denotes the expected fraction of erasure mes- 
sages at the ^th iteration and is its initial value given 
by xq = 5. The recovery of original data is successfully 
done if xi converges to zero. The threshold 5* , defined by 
the supremum of all 5 that result in successful decoding, 
tells the code's performance. For a given code rate i?, 
the threshold is upper bounded by 1 — i? [l^ . 

With the help of the above density evolution method, 
we calculate the error-correction capability of the scale- 
free networks given in the form of Eq. (|3J). The results are 
shown in Fig. |3 as a function of the maximum variable- 
node degree dmax, where the code rate is fixed at i? = 0.5. 
It is seen that the threshold erasure probability i5* in- 
creases as the maximum variable-node degree increases. 
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FIG. 2: Error-correction capability of optimized scale- free 
networks over a binary erasure channel. The code rate is 
R = 0.5. 
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FIG. 3: Iteration numbers of optimized scale- free networks 
(solid curve) and the Tornado code (dashed curve) for max- 
imum degrees (a) dmax = 610 and (b) dmax = 1009. The 
criterion for convergence of decoding is set to be < 1 x 10~^. 



For large dmax, the threshold almost reaches the the- 
oretical upper bound 1 — i?, indicating that the error- 
correction capability of our code is very good. For com- 
parison, we have also studied the error-correction capa- 
bility of codes that have degree distributions other than 
the power-law distribution, namely the exponential dis- 
tribution of the form p{k) ^ e"'^'-'^"'""-' and the Gaussian 
distribution of the form p{k) ^ e~Pik+a) _ -y^g ^^^^ that 
the threshold for the exponential distribution rapidly in- 
creases with dmax and converges to 0.465, a value much 
lower than the threshold for the power-law distribution. 
The case for the Gaussian distribution is observed to ex- 
hibit a similar behavior with a similar, low convergence 
limit. 

To more clearly demonstrate the high performance of 
codes on scale-free networks, we compare them with the 
Tornado code fvA . The threshold of our code is pre- 



TABLE L Error-correction capabilities of scale-free networks 
(SFN) and the Tornado code [l^ of rate R = 0.5, and pa- 
rameters 7 and a optimizing the SFN. The optimization was 
done by using a direction set method. 
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sented in Tabled along with the parameters a and 7 that 
maximize the code's performance. TableQlshows that our 
code yields better performance than the Tornado code for 
dmax smaller than about 1000. Also shown in Tabled is 
the average variable-node degree (k) of the two codes. 
From a practical viewpoint, it is important to design a 
code that yields good performance for small (fc), since 
the physical complexity of a code, which grows with in- 
creasing (fc), limits the hardware implementation of the 
code. For this reason, our code seems to be better suited 
to applications than the Tornado code. 

Another merit of our code is that the iteration number 
required for convergence of decoding is very small. The 
iteration numbers of our code and the Tornado code are 
compared in Fig. Of a) as a function of the erasure prob- 
ability for the case of dmax — 610, which clearly shows 
that our code has a smaller iteration number than the 
Tornado code in the whole region of 6. Even for the 
case of dmax > 1000 where the Tornado code has a little 
higher threshold than our code, the iteration number is 
smaller for our code than for the Tornado code over a 
broad region of S, except near the threshold [Fig. IS^b)]. 
For an early convergence of decoding processes, each node 
needs to gather messages from other nodes quickly. This 
implies that graphs with smaller diameter may be more 
advantageous to reducing the iteration number. This in 
turn suggests that scale-free networks, which are known 
to have a very small diameter d ~ In In TV 22] where 
N is the number of nodes, may require a smaller iter- 
ation number than regular random networks or small- 
world networks 23] . 

The error-correction capability of codes on scale-free 
networks can be further enhanced by adjusting the de- 
gree distribution, especially in the low degree region, so 
that it more closely models realistic scale-free networks 
whose degree distribution does not necessarily follow a 
power law for small k. While doing this, we try to keep 
as small as possible the number of parameters added to 
the generating function. After a number of numerical 
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TABLE II: Performance of codes on scale-free networks 
[Eq. 0]. The same optimization parameters a and 7 as in 
Table are used. 
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simulations we have found the following generating func- 
tion adequate enough for this purpose: 



G{x) = C 



W2P{2)X^ + W3P{3)X^ 



k=4 



(5) 



where p{k) = (fc + a)^'^ . The performance of this code 
is displayed in Tabled which shows that by adding two 
new parameters, W2 and W3, which permit the two lowest 



degrees to vary from the power law, the performance of 
the code is increased. Addition of more parameters is 
expected to give rise to an increased performance, but at 
the expense of rendering the optimization process more 
time consuming. 

In summary, we have found that many high- 
performance LDPC codes possess degree distributions 
well fitted by power-law functions with exponents close 
to 2. Based on this finding, we have developed codes on 
scale-free networks that have very good error-correction 
capabilities. The codes with power-law degree distribu- 
tion yield better performance than those with exponen- 
tial and Gaussian degree distributions that have fast de- 
creasing tails. It also would be interesting to study the 
effect of degree correlations on the performance of a code, 
which is left as future work. As good error-correcting 
codes, the codes on scale-free networks could find lucra- 
tive applications in areas as diverse as wireless commu- 
nication, media and data transfer over the Internet, and 
storage. 



[1] R. Albert and A. L. Barabasi, Rev. Mod. Phys. 74, 47 
(2002). 

[2] M. Faloutsos, P. Faloutsos, C. Faloutsos, Comput. Com- 

mun. Rev. 29, 251 (1999). 
[3] G. Caldarelli, R. Marchetti, and L. Pietronero, Europhys. 

Lett. 52, 386 (2000). 
[4] R. Albert, H. Jeong, and A. L. Barabasi, Nature 401, 

130 (1999). 

[5] B. A. Huberman, L. A. Adamic, Nature 401, 131 (1999). 

[6] M. E. J. Newman, Phys. Rev. E 64, 016131 (2001). 

[7] H. Jeong, B. Tombor, R. Albert, Z. N. Oltvai, and A. L. 

Barabasi, Nature 407, 651 (2000). 
[8] A. L. Barabasi and R. Albert, Science 286, 509 (1999). 
[9] P. L. Krapivsky, S. Redner, and F. Leyvraz, Phys. Rev. 
Lett. 85, 4629 (2000). 
[10] S. N. Dorogovtsev, J. F. F. Mendes, A. N. Samukhin, 

Phys. Rev. Lett. 85, 4633 (2000). 
[11] S. H. Strogatz, Nature 410, 268 (2001). 
[12] C. Berrou and A. Glavieux, IEEE Trans. Commun. 44, 
1261 (1996). 

[13] R. G. Gallager, Low-Density Panty-Check Codes (MIT 

Press, Cambridge, MA, 1963). 
[14] D. J. C. MacKay, IEEE Trans. Inform. Theory 45, 399 

(1999). 

[15] T. J. Richardson and R. L. Urbanke, IEEE Trans. In- 



form. Theory 47, 599 (2001); T. J. Richardson, M. A. 

ShokroUahi, and R. L. Urbanke, ibid. 47, 619 (2001). 
[16] G. D. Forney, Physica A 302, 1 (2001); Special Issue on 

Codes on Graphs and Iterative Algorithms, IEEE Trans. 

Inform. Theory 47, No. 2 (2001). 
[17] M. G. Luby, M. Mitzenmacher, M. A. ShokroUahi, and 

D. A. Spielman, IEEE Trans. Inform. Theory 47, 569 

(2001). 

[18] S. Y. Chung, G. D. Forney Jr., T. J. Richardson, and R. 
Urbanke, IEEE Commun. Lett. 5, 58 (2001). 

[19] A. ShokroUahi, in Proc. 13th Int. Symp. Applied Alge- 
bra, Algebraic Algorithms, and Error-Correcting Codes, 
edited by M. Fossorier, H. Imai, S. Lin, and A. Poli, 
Lecture Notes in Computer Science Vol. 1719 (Springer- 
Verlag, New York, 1999); P. Oswald and A. ShokroUahi, 
IEEE Trans. Inform. Theory 48, 3017 (2002). 

[20] M. E. J. Newman, S. H. Strogatz, and D. J. Watts, Phys. 
Rev. E 64, 026118 (2001). 

[21] D. J. C. MacKay S. T. Wilson, and M. C. Davey IEEE 
Trans. Commun. 47, 1449 (1999). 

[22] R. Cohen and S. Havlin, Phys. Rev. Lett. 90, 058701 
(2003). 

[23] D. J. Watts and S. H. Strogatz, Nature 393, 440 (1998). 



